Signature Files: An Integrated Access Method for Formatted and Unformatted Databases
نویسندگان
چکیده
----*------.----..-...-.-...-.*.-*-....-.-*.----..----.-*...-...-...---.----.--... ".. The signature file approach is one of the most powerful information storage and retrieval techniques which is used for finding the data objects that are relevant to the user queries. The main idea of all signature based schemes is to reflect the essence of the data items into bit patter& (descriptors or signatures) and store them in a separate file which acts as a filter to eliminate the non aualifvine data items for an information reauest. It pro;ides an integrated access method for both formattid and &formatted databases. A comp&ative overview and discussion of the proposed signatnre generation methods and the major signature file organization schemes are presented. Applications of the signature techniques to formatted and unformatted databases, single and multiterm query cases, serial and paratlei architecture. static and dynamic environments are provided with a special emphasis on the multimedia databases where the pioneering prototype systems using signatnres yield highly encouraging results.
منابع مشابه
The HOOKAH Information Extraction System
The focus of Project HOOKAH is to improve the processing of the DEA-6 report, a semi-formatted report generated primarily by field agents, as well as legal staff, analysts, and others. DEA-6s are organized into case files, and are composed of multiple sections with varying amounts of formatting. Header fields are normally highly formatted, and indicate the subject, case, date, time, etc. There ...
متن کاملDesign of a Signature File Method that Accounts for Non-Uniform Occurrence and Query Frequencies
In this paper we study a variation of the signature Ale access method for text and attribute retrieval. According to this method, the documents (or records) are stored sequentially in the “text flle”. Abstractions (“signatures”) of the documents (or records) are stored in the “signature Ale”. The latter serves as a Alter on retrieval: It helps discarding a large number of non-qualifying documen...
متن کاملA Superimposed Coding Scheme Based on Multiple Block Descriptor Files for Indexing Very Large Data Bases
A new signature file method for accessing information from large data files containing both formatted and free text data is presented. The new method, called the multiorganizational scheme is proposed for indexing very large data files containing hundreds of thousands or possibly millions of records.
متن کاملAn integrated genetic data environment (GDE)-based LINUX interface for analysis of HIV-1 and other microbial sequences
MOTIVATION Sequence databases encode a wealth of information needed to develop improved vaccination and treatment strategies for the control of HIV and other important pathogens. To facilitate effective utilization of these datasets, we developed a user-friendly GDE-based LINUX interface that reduces input/output file formatting. DESIGN AND RESULTS GDE was adapted to the Linux operating syste...
متن کاملA Method for Protecting Access Pattern in Outsourced Data
Protecting the information access pattern, which means preventing the disclosure of data and structural details of databases, is very important in working with data, especially in the cases of outsourced databases and databases with Internet access. The protection of the information access pattern indicates that mere data confidentiality is not sufficient and the privacy of queries and accesses...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008